• Àüü
  • ÀüÀÚ/Àü±â
  • Åë½Å
  • ÄÄÇ»ÅÍ
´Ý±â

»çÀÌÆ®¸Ê

Loading..

Please wait....

±¹³» ³í¹®Áö

Ȩ Ȩ > ¿¬±¸¹®Çå > ±¹³» ³í¹®Áö > Çѱ¹ÀÎÅͳÝÁ¤º¸ÇÐȸ ³í¹®Áö

Çѱ¹ÀÎÅͳÝÁ¤º¸ÇÐȸ ³í¹®Áö

Current Result Document :

ÇѱÛÁ¦¸ñ(Korean Title) ÀÛ¼ºÀÚ ºÐ¼®°ú CNNÀ» Àû¿ëÇÑ ¼Ò½º ÄÚµå ÀÛ¼ºÀÚ ½Äº° ÇÁ·¹ÀÓ¿öÅ©
¿µ¹®Á¦¸ñ(English Title) The Identification Framework for source code author using Authorship Analysis and CNN
ÀúÀÚ(Author) ½Å°ÇÀ±   ±èµ¿¿í   È«¼º»ï   ÇÑ¸í¹¬   Gun-Yoon Shin   Dong-Wook Kim   Sung-sam Hong   Myung-Mook Han  
¿ø¹®¼ö·Ïó(Citation) VOL 19 NO. 05 PP. 0033 ~ 0041 (2018. 10)
Çѱ۳»¿ë
(Korean Abstract)
ÃÖ±Ù ÀÎÅÍ³Ý ±â¼úÀÌ ¹ßÀüÇÔ¿¡ µû¶ó ´Ù¾çÇÑ ÇÁ·Î±×·¥µéÀÌ ¸¸µé¾îÁö°í ÀÖ°í ÀÌ¿¡ µû¶ó ´Ù¾çÇÑ ÄÚµåµéÀÌ ¸¹Àº »ç¶÷µéÀ» ÅëÇØ ¸¸µé¾îÁø´Ù. ÀÌ·¯ÇÑ Ãø¸éÀ» ÀÌ¿ëÇÏ¿© ƯÁ¤ ÀÛ¼ºÀÚ°¡ ÀÛ¼ºÇÑ ÄÚµåµé ±×´ë·Î °¡Á®°¡ ÀÚ½ÅÀÌ ÀÛ¼ºÇÑ °Íó·³ º¸¿©Áְųª, Âü°íÇÑ ÄÚµåµé¿¡ ´ëÇÑ Á¤È®ÇÑ Ç¥±â ¾øÀÌ ±×´ë·Î »ç¿ëÇÏ¿© ÀÌ¿¡ ´ëÇÑ º¸È£°¡ Á¡Â÷ ¾î·Á¿öÁö°í ÀÖ´Ù. µû¶ó¼­ º» ³í¹®¿¡¼­´Â ÀÛ¼ºÀÚ ºÐ¼® À̷аú ÇÕ¼º°ö ½Å°æ¸Á ±â¹Ý ÀÚ¿¬¾î ó¸® ¹æ¹ýÀ» Àû¿ëÇÑ ÀÛ¼ºÀÚ ½Äº° ÇÁ·¹ÀÓ¿öÅ©·ê Á¦¾ÈÇÑ´Ù. ÀÛ¼ºÀÚ ºÐ¼® ÀÌ·ÐÀ» Àû¿ëÇÏ¿© ¼Ò½º Äڵ忡¼­ ÀÛ¼ºÀÚ ½Äº°¿¡ ÀûÇÕÇÑ Æ¯Â¡µéÀ» ÃßÃâÇÏ°í À̸¦ ÅؽºÆ® ¸¶À̴׿¡¼­ »ç¿ëÇÏ°í Àִ Ư¡µé°ú °áÇÕÇÏ¿© ±â°èÇнÀ ±â¹ÝÀÇ ÀÛ¼ºÀÚ ½Äº°À» ¼öÇàÇÑ´Ù. ±×¸®°í ÇÕ¼º°ö ½Å°æ¸Á ±â¹Ý ÀÚ¿¬¾î ó¸® ¹æ¹ýÀ» ¼Ò½º Äڵ忡 Àû¿ëÇÏ¿© ÄÚµå ÀÛ¼ºÀÚ ºÐ·ù¸¦ ¼öÇàÇÑ´Ù. º» ³í¹®¿¡¼­´Â ÀÛ¼ºÀÚ ºÐ¼® À̷аú ÇÕ¼º°ö ½Å°æ¸ÁÀ» Àû¿ëÇÑ ÀÛ¼ºÀÚ ½Äº° ÇÁ·¹ÀÓ¿öÅ©¸¦ ÅëÇØ ÀÛ¼ºÀÚ¸¦ ½Äº°Çϱâ À§Çؼ­´Â ÀÛ¼ºÀÚ ½Äº°¸¸À» À§ÇÑ Æ¯Â¡µéÀÌ ÇÊ¿äÇÏ´Ù´Â °Í°ú ÇÕ¼º°ö ½Å°æ¸Á ±â¹Ý ÀÚ¿¬¾î ó¸® ¹æ¹ýÀÌ ¼Ò½º ÄÚµåµî°ú °°Àº Ư¼öÇÑ Ã¼°è¸¦ °®Ãß°í ÀÖ´Â ¾ð¾î¿¡¼­µµ Àû¿ëÀÌ °¡´ÉÇÏ´Ù. ½ÇÇè °á°ú ÀÛ¼ºÀÚ ºÐ¼® ÀÌ·Ð ±â¹Ý ÀÛ¼ºÀÚ ½Äº° Á¤È®µµ´Â 95.1%¿´À¸¸ç CNNÀ» Àû¿ëÇÑ °á°ú ¹Ýº¹È½¼ö°¡ 90¹ø ÀÌ»óÀÏ °æ¿ì 98% ÀÌ»óÀÇ Á¤È®µµ¸¦ º¸¿©Áá´Ù.
¿µ¹®³»¿ë
(English Abstract)
Recently, Internet technology has developed, various programs are being created and therefore various codes are being made through many authors. On this aspect, some author deceive a program or code written by other particular author as they make it themselves and use other writers' code indiscriminately, or not indicating the exact code which has been used. Due to this makes it more and more difficult to protect the code. In this paper, we propose author identification framework using Authorship Analysis theory and Natural Language Processing(NLP) based on Convolutional Neural Network(CNN). We apply Authorship Analysis theory to extract features for author identification in the source code, and combine them with the features being used text mining to perform author identification using machine learning. In addition, applying CNN based natural language processing method to source code for code author classification. Therefore, we propose a framework for the identification of authors using the Authorship Analysis theory and the CNN. In order to identify the author, we need special features for identifying the authors only, and the NLP method based on the CNN is able to apply language with a special system such as source code and identify the author. identification accuracy based on Authorship Analysis theory is 95.1% and identification accuracy applied to CNN is 98%.
Å°¿öµå(Keyword) ÀÛ¼ºÀÚ ½Äº°   ÀÛ¼ºÀÚ ºÐ¼®   ÇÕ¼º°ö ½Å°æ¸Á   ±â°èÇнÀ   ÄÚµå ºÐ¼®   Author Identification   Authorship Analysis   Convolutional Neural Network   Machine Learning   Code Analysis  
ÆÄÀÏ÷ºÎ PDF ´Ù¿î·Îµå